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BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to a system and method for analyzing a user query or 
natural language query and generating a results and related questions. 

More particularly, the present invention relates to a system and method for enhancing 
information retrieval from a user posed query (Boolean or natural language) including 
determining keywords associated with the query, producing a result corresponding to the 
query, generating terms related to the keywords, supplying the keywords and terms to a data 
mining routine, generating related results and/or information and questions associated with 
additional results and/or information related to the query, and displaying the results and 
questions, which the user can then activated and/or investigated. 

2. Description of the Related Art 

Current web searching generally involves construction of a query by a user that is then 
sent via an information infrastructure such as the internet or world wide web to an 
application site for processing. The processing site, typically a search engine site, then 
obtains a set of sites on the infrastructure that have information relating or corresponding to 
the query. The search engine site can also rank the information containing-sites relative to 
some particular internal ranking procedure. However, search engines and the sites devoted 
to them are currently ill prepared to take advantage of information deposited in large 
database especially multi-dimensional database such as OLAP database and are ill prepared 
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to delve deeply into data to find other information that may be of interest to a user. 

This other information is generally contained in databases that often require 
sophisticated routines to act as intermediaries so that the search engine, and ultimately the 
user, can extract meaningfully data and information from them. The intermediaries are 
generally of two types: middleware interfaces (MWIs) and data mining routines or algorithms 
(DMRs). MWIs provide information about data in the data database, e.g. , variable lists, type 
of data preprocessing (averages, means, standard deviations, etc), data storage criteria and 
classification, etc. DMRs provide mechanisms for extracting data from the database using 
routines to further process and classify data in the database. For data mining routines to work 
properly, they need the actual records or database layout in order to construct data 
manipulations and ranking, e.g. , construction of a decision tree prior to performing a ranking 
of the data in the decision tree. MWIs exist for relational database and in a co-pending 
application, the inventor described a MWI for multi-dimensional database such as OLAP 
database, United States Patent Application Serial No. 09/713,674, filed 1 5 November 2000, 
incorporated herein by reference. 

Thus there is a need in the art for a system that will allow a user to utilize data stored 
in diverse databases more effectively and to provide the user with a method for enhancing 
and/or expanding the richness of data and/or information corresponding to or related to a 
user's query and/or to refine the query to obtain results of interest to the user. 

SUMMARY OF THE INVENTION 

The present invention relates to a method for analyzing a query and generating related 
results including determining keywords associated with the query, polling a database to 
determine terms related to the keywords, supplying the keywords and terms (all or some) to 
a data mining routine and generating a results related to the query and questions for refining, 
expanding or enhancing retrieved information. 

The present invention also provides a method for enhancing information retrieval 
content from a query including retrieving direct data responsive to the query, extracting query 
element from the query, inputting the elements to a data mining routine, and outputting 
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results from the data mining routine, where the results include related data and suggested 
questions for enhancing or refining retrieved results. 

The present invention also provides a system for enhancing query information 
retrieval content, where the system includes a remote digital processing unit (rDPU), a query 
information retrieval content enhancing server (QIRCES), a database server (DBS), an 
information infrastructure such as a local area network (LAN), a wide area network (WAN) 
or a global information infrastructure (Gil) interconnecting the rDPU and the servers. The 
rDPU includes a query generator and communication hardware and software for interacting 
with the servers over the information infrastructure. The QIRCES includes query 
information content enhancing software comprising a scheduler, a query parser, a user 
profiler, a database, a query/results database (qrDB), middleware interface (MWI), data 
mining algorithms or routines (DMRs), a library of database interfaces, an email controller, 
communication hardware and software and visualization software, and an expert. And, the 
DBS includes an informational database (iDB) and database services such as OLAP services 
for an OLAP database and SQL services. 

The present invention also provides a method for analyzing a query and generating 
related results including forming a query, inputting the query to a DB, outputting results from 
the DB corresponding directly to the query, extracting query elements from the query, where 
the element comprises keywords and optionally constraints, generating related query 
elements comprising related keywords and optionally related constraints, inputting the 
elements and/or related elements to a DMR and outputting related results and questions from 
the DMR for query information retrieval content refinement. 

The present invention also provides a method for analyzing a query and generating 
related results and refinement questions including determining query element associated with 
the query, polling a database to determine related query elements, selecting some or all of the 
elements and/or related elements, supplying the selected elements and/or related element to 
a data mining routine, generating related results and questions from the DMR for query 
information retrieval content refinement and outputting the related results and questions for 
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user interaction. 

The present invention also provides a system including a middleware interface, a data 
mining communication protocol, a database communication protocol, a query element 
classification protocol designed to determine query elements (keywords and constraints) from 
5 a query and classify the elements according to a classification protocol compatible with a 
given database, a related query element routine, which generates related query elements 
based on the element classification and interaction with the database, a communication 
protocol where the elements and related elements submitted to a data mining routine, a 
receiving routine to receive results from the data mining routine and a presentation routine 
10 where the results and questions for refining the query results from the data mining routine 
are presented to a user in a predetermined statically significant order so that the user can 
p enhance the information retrieval content of his/her original query. 
03 The present invention also provides a method for enhancing query information 

L retrieval content including: obtaining a query comprising at least one keyword and optional 
ljff constraints including a containment constraint, a grouping constraint, a connector constraint 
W or a data constraint; generating at least one related keyword and optionally related 
O constraints; obtaining results and/or information for the query "as is"; generating related 
k* results and/or information and at least one question related to the query via the operation of 
q a data mining routine; displaying the results and/or information, the related results and/or 
20" information and questions; selecting a question; generating question results and/or 
information and sub-questions; displaying the question results and/or information and sub- 
questions; and repeating the last three steps to from a query-by-question path. The method 
can also include the step of saving the path. The method can also includes comparing saved 
paths. 

25 DESCRIPTION OF THE DRAWINGS 

The invention can be better understood with reference to the following detailed 
description together with the appended illustrative drawings in which like elements are 
numbered the same: 
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Figure 1 depicts a block diagram of a preferred overall system of this invention for 
enhancing information content retrieval from a query; 

Figure 2 depicts a block diagram of a preferred embodiment of a system 
communication protocol of the system of Figure 1; 
5 Figure 3 depicts a block diagram of a preferred system architecture of the system of 

Figure 1; 

Figure 4 depicts a screen image of a preferred embodiment of a user interface of the 
present invention showing a preferred embodiment of a natural language query input screen; 
Figure 5 depicts a screen image of a preferred embodiment of a user interface of the 
10 present invention showing a preferred embodiment of a Boolean query input screen; 

Figure 6 depicts a screen image of a preferred embodiment of a user interface of the 
□ present invention showing a preferred embodiment of a search results screen; and 
0 j Figure 7 depicts a screen image of a preferred embodiment of a user interface of the 

L present invention showing a preferred embodiment of result specific screen, 
g DETAILED DESCRIPTION OF THE INVENTION 

yj The inventor has found that a system and method for enhancing retrieved 

O informational content from a query-based search format can be constructed where the system 
H and method returns not only results and/or information related directly to the query, but also 
p returns results and/or information related to or associated with the query. The inventor has 
20" found that this system and method can be implemented on a distributed digital processing 
environment, where the environment includes remote digital processing units (rDPUs) and 
server digital processing units (sDPUs or Servers) communicationally interconnected via an 
information infrastructure including a global information infrastructure (Gil) such as the 
internet or the world wide web, or a local network or LAN. 
25 The present invention broadly relates to a system and method for enhancing the results 

and/or informational content retrieved from a query, whether the query is a Boolean query 
or a natural language query. The results and/or informational content is enhanced by running 
one or more data mining routines against the query to generate related data and one or more 



Specification 

Express Mail No.iEL 389 348 146 US 

C:\WINDOWS\Desktop\My Briefcase\Anwar09UTL. PA.wpd 



Page 5 



Robert W. Strozier, p.l.lc. 



possible sub-queries that may be of interest to the user. 

The present invention broadly relates to a method for enhancing the results and/or 
informational content retrieved from a query including receiving a query, obtaining results 
and/or information directly related to the query, submitting the query to one or more data 
mining routines which generate results and/or information related to the query and generate 
one or more options or sub-queries for refining the query or for investigating results and/or 
information related to the query or that the system determines may be of interest to the user. 
The related data and sub-queries are presented to the user in a list or page format so that the 
user can review and/or investigate the data or sub-queries by clicking on a desired related 
data result or a desired sub-query. When a user selects a sub-query, the system will act on 
the sub-query and generate results and/or information related to the sub-query as well as sub- 
sub-queries based on the processing the sub-query. Thus, the user can be walked down a 
query-by-question pathway to improve result and information content derivable from any 
given query. 

The system includes routines to receive a query and to post the query "as is" to a DB. 
If the query is a natural language query, then the system includes routines to extract elements 
from the query. The system also includes routines to determine related query element based 
on the query element (keywords and connectors). Once the system has the query elements 
and related query elements, the system passes these elements to one or more data mining 
routines (DMRs), where the all of the elements, or some selected elements, are used in each 
DMR to generate related results comprising related results and/or information and one or 
more suggested sub-queries for refining and enhancing the information content derived from 
the query. The related results and suggested sub-queries can be presented to the user as an 
active list or on a page-by-page basis. Although the DMRs can communicate with databases, 
including without limitation multidimensional databases (MDDBs), relational databases, 
hierarchical databases or the like, directly, the preferred communication pathway involves 
an intermediary called a middleware interface as described in co-pending application United 
States Patent Application Serial No. 09/713,674, filed 15 November 2000, incorporated 
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herein by reference. 

The system can also include a database for storing queries and results. The system can 
also include routines for running a user profile against the stored queries and results to 
inform a user of results and/or information that the user may find of interest based on the 
user's profile. The system can also include routines for forming user displayable screens or 
pages devoted to frequently submitted queries, interesting data resulting from queries, or the 
entire contents of the query/results database. The system can also include routines for 
performing data analysis and manipulation of data in the query/result database alone or in 
conjunction with data analysis and manipulation of data from Dbs. The system can also 
include background routines that search DBs and other databases for results and/or 
information that may be of interest to users based on the profiles in the user profile database. 
The user profile database can be categorized or classified based on a scheme that groups 
users into categories or classes so that background data mining protocols can be tailored to 
derive results and/or information for all users or for each category or class. The system can 
also include user interactive procedures for ranking the relevancy of related data and sub- 
queries to further refine user profiling and enhance and enrich a user's access to results and/or 
information of interest to the user. The system can also includes an email interface for 
providing the results in an email context. 

For internet implementation, the system of the present invention comprises user 
rDPUs and sDPUs including an application server (asDPU) and a database server (dbsDPU). 
The rDPUs include a browser which is the communication conduit between the user and the 
asDPU, which is generally based on HTML or some other similar communication protocol. 
The asDPU communications with the dbsDPUs either directly or via a M WI using standard 
database communication protocols. 

Suitable digital processing units, both remote DPUs and Servers, can be any digital 
processing device including, without limitation, digital processing devices manufactured by 
Dell Corporation, Compaq Corporation, Intel Corporation, Motorola Corporation, Texas 
Instruments, Inc., IBM, AMD, Cyrix, or any other manufacturing of digital processing 
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devices. The memory can be any memory compatible with the particular digital processing 
device. 

Suitable operating systems include, without limitation, windowing operating systems, 
UNIX based operating systems or any other operating system. Suitable communication 
5 hardware and software can be any software and hardware that supports any narrow band or 
wide band communication protocols, with wide band, high speed communication protocols 
being preferred. 

Suitable data mining routines or algorithms that can be used by the system of this 
invention include, without limitations, a chi squared DMR, a correlation DMR, a decision 

1 0 tree DMR, a market basket type DMR, a naive Bayes DMR based on Bayesain statistics, an 
association DMR, a cluster DMR or other similar data mining routines or algorithms or 

p mixtures or combinations of one or more DMRs, some of which are described in co-pending 

% United States Patent Application Serial No. 09/713,674, filed 15 November 2000, 

I* incorporated herein by reference and other are well-known public domain data mining 

fg routines. 

M User Interface 

o Login Page 

11 When a user connects to the QIRCES system for the first time, the system prompts the 
%l user for a unique user ID and password, with standard password reentry to insure proper 

2ff password assignment. Once a user ID and password has been established and stored by the 
system, the next time the user connects to the system, entry of the user ID and password will 
allow the user access to the system. If the user is a Gil user using a browser to connect to 
the QIRCES server, then the user can elect to have authentication processing saved in 
browser cookie file. When the user registers (first time user), the user can elect to save the 

25 login file (user ID and password) in a cookie. Of course, the user can elect this option any 
time he/she connects to the system. If the user chooses to save her/his login profile in a 
cookie file, then the Login Page will not appear the next time the user connects to the system. 
If the user elects not to have a cookie file containing the necessary login information or if the 
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user's browser does not support cookies, then the Login Page will appear every time the user 
connects to the server and the user will have to complete the standard login procedure. 
User's Home Page 

In a preferred implementation of the QIRCES system of this invention, each user 
would have a home page on the server, which is created when a user first registers with the 
QIRCES system. Each time the user connects to the QIRCES system after registration, the 
user goes directly to his/her home page. The user can customize her/his page. The home 
page is used by the user to save results to set and modify preferences and to view postings 
from the system that fit the user's profile or that the system determines may be of interest to 
the user. 

After the user passes authentication, the browser displays the user Home Page. The 
main section of this page allows the user to navigate projects, favorites, preferences, view 
hot news, recent projects, server notifications, etc. If the browser supports frames, then the 
page can be frame-based for further convenience and functionality. This page preferably 
includes navigation and information areas or domains. The first domain or area, which can 
be located in the left 20% of the page by default (changeable), can include links to: (1) the 
main section of the user's Home Page; (2) the user's workplace; (3) a favorites page; and (4) 
the user's preferences. 

If the user has an email account on the system, the server administrator can provide 
a web-based interface to a user's mailbox. This interface can be included in the user's Home 
Page and there can be a link to this interface in the navigation area of the Home Page. The 
user's workplace link has child links to the user's recent projects, a new project wizard and 
other workplace related functions. The user's favorites page link includes user defined child 
links organized in folders that in their turn can have child links and folders. The server 
administrator can predefine some links and folders, while other can be defined by the user. 
The user's preferences link includes child links to different preference sets such as global 
preferences, mailbox preferences, query construction preferences, results preferences, etc. 
User's Workplace 
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In a preferred embodiment of the QIRCES system, the system creates for each user 
a user workplace. The workplace is used by the user to create projects that allow the user to 
gather information on an as-needed basis or a periodic basis. The system saves information 
about each user project for review, retrieval, modification, analysis or the like. Via 
5 workplace preferences, the user can choose the type of information displayed when the user 
workplace page is opened, e.g., display recent projects sorted by last access date or display 
only the latest accessed projects or display the most frequency accessed projects. The 
workplace page can also include a link to a new project wizard that allows the user to create 
a new project. When working with a project, the user can switch the workplace interface 
10 between two modes: (1) a confirmatory mode and (2) an exploratory mode. The user can 

work in either mode independently, 
r i The confirmatory mode allows the user to go directly a particular database or database 

X site such as an MDX cube and pose a query to that particular database, le. the confirmatory 
[7 mode is a single DB-single query mode. If the user wants to work with different cubes or 
W queries, the user must create a different project, one for each cube and/or query. 
03 The exploratory mode allows the user to pose a query to any number or database or 

o to all databases that are accessible to the system and contain information relevant to the 
LL posed query. The exploratory mode uses a search engines and surfer type interface. Results 
% are then displayed for the user's review. 

2tT There are two kinds of searches: Boolean searches and Natural Language searches. 

The Natural Language search is preferable English; however, the Natural Language interface 
can support other languages. When operating in English, the Natural Language search mode 
is sometimes called the English Language search mode. Boolean searching is based on a set 
of constraints. Each constraint includes a text field (keyword - word or words), a 

25 containment option (must contain, must not contain, should contain, etc.), a grouping option 
(the word(s), the phrase, etc.), a connectors connecting text fields (and, or, not, andnot, nor, 
etc.) and a data option having the following variants: (1) filter; (2) dimension member; (3) 
dimension, drilled down to the level of member; (4) member's child members; and (5) drilled 
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down parent member. Search engine results can be formatted, sorted or categorized as 
desired. 

Search Engine Interface 

In a preferred embodiment of the system of this invention, the system includes a 
5 search engine interface (SEI), which is based on popular search engine concepts such as 
those found in search engines like AltaVista, Excite, or the like. The SEI allows the user to 
pose queries in a variety of search formats including Boolean queries, Natural Language 
queries, predefined queries and DB structured queries. Using the SEI, the user will construct 
a query in a manner similar to the way the user would construct queries in a typical search 

10 engine. Once the user constructs a query, the SEI allows the user to submit the query by 
hitting enter or a search button associated with the SEL Such an SEI is described in greater 

j ;| detail in conjunction with the description of Figures 6A&B and 7A&B. 

55 Search Engine Query Result 

[7 Once the user has constructed and submitted a query, the SEI presents the results of 

fif the query and the query refining process (DMR results and sub-queries) a list format similar 
CM to results presented in a typical search engine. Although each list member includes a brief 
o textual description, it does not point to a URL as it would in a typical search engine, but 
u instead is a pointer into a particular results section of the query results as shown in Figure 
p 7 A. The first or top query result section contains results and/ or information derived from the 
2tT query "as is" along with certain obvious refinements, e.g., time, location, product, etc. 
Subsequent result sections include results and/or information from the operation of each 
DMR on the query elements and related elements. These results include simple refinements 
such as a particular type of a broad class of a keyword {e.g., coke from the keyword drink) 
as well as more complex refinements that actually amount of a new refined query or question. 
25 When a more complex refinement is selected by the user, the user will be given results and/or 
information from the refinement that can include simple refinements as well as more 
complex refinements, Le., another query or question. Thus, the user can progress down a 
query-by-question path viewing results along the way in a cross-tabulated format and a 
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graphical format as shown in Figure 7B. 
Surfer Interface 

In another preferred embodiment of the system of this invention, the interface does 
not include a search engine query construction and submission construct or includes a surfer 
switch that permits the user to toggle between the SEI and the surfer interface. The surfer 
interface permits the user to bypass the query construction and submission window and 
instead to surf and/or view results of queries that the user has already submitted or that have 
been previously submitted by other users. These results can be all results in the application 
database or a profile restricted or filtered set of results based on user preferences. These 
results of existing queries can be categorized as follows: (1) predefined queries defined by 
site administrators, database administrators or the like; (2) popular user queries; (3) queries 
that are created as a result of background data mining operations; or (4) all results in the 
system results database. 
Email Interface 

In another preferred interface, the user can construct and submit queries and await 
results via an email interface such a SMTP or WAP. Because certain queries posed by a user 
may require considerable processing time, the user can chose to submit the search and await 
results notification via the email interface. Alternatively, the user can fill out a basic 
template providing information about the type of information the user in interested in to 
create a user profile corresponding to the information entered into the template by the user. 
The user can specify what frequency of email notification the user desires, e.g., very 
frequent, frequent, or infrequent. The user will be able to fine tune the email frequency that 
is optimal for the user and the user can fine tune the content of the information the user is 
interested in. The email messages will include a result section as described in connection 
with the Search Engine Query Results section and Figure 7 A herein. Thus, the email 
interface, which can be used in conjunction with the SEI or the surfer interface or all by 
itself, allows the user access to results and/or information of interest to the user on a time 
frame definable by the user. Thus, the user can be notified by email anytime a search that 
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fits the user's profile is submitted or only when the results of a query fitting the user's profile 
includes interesting results. By interesting results, the inventor means results that show a 
high direct or inverse correlation with other data, that show data significantly impacted by 
data that fits the user's profile or any other statistically significant correlations involving data 
5 that fits the user's profile. 
Back End Processing 

In another preferred embodiment of the system of this invention, the system includes 
back end processing routines for mining the data that may be of interest to a particular user 
or to the user community in general. Thus, the system on the application server(s) can track 
10 user activity and preferences so that the system routines can better tailor results and/or 
information content for each user or the user community in general. The system will track 
r j user behavior including, without limitation, search habits, query structures, results ratings, 
% site preferences, feature preferences, and/or other personal preferences as well as user 
f * community habits including, without limitation, popular query formats, popular sites, popular 
W system components, or the like. The system uses the tracked data to improve system features 
pj and/or facilities and/or to improve retrieved informational content for the whole community, 
O a part of the community and/or a particular user in the community, 
ju Query-by-Question Pathways 

~ One powerful aspect of the system of this invention is the ability for the system to 

28" walk a user down a path of results and/or information related to or derived from a single 
query. As each DMR returns results and/or information derived from the original query and 
generates one or more sub-queries that may be of interest to the user, the user can embark on 
an exploratory survey of results and/or information derived from each sub-query and each 
sub-query generated by the DMR from a selected sub-query. Thus, the user can be directed 
25 on a question by question basis to results and/or information related many level down a 
query-by-question pathway. Of course, each pathway will be different depending on the 
particular sub-query selections made by the user. 
System Architecture 
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The system is preferably designed to run on one or more dedicated application servers 
that receive queries, retrieve direct results to the queries and trigger DMRs to ferret out 
related results and/or information. As interesting relationships are found, the system stores 
the query and the results in a database. The system periodically analyzes the database to 
5 determine whether new databased results fit a user profile and notifies the user via the email 
interface. 

One preferred architecture for the system of this invention, breaks the system into 
three basic levels: Presentation; Application and Data. 
Presentation Level 

1 0 A preferred presentation level for systems implemented on a Gil includes components 

which run on the user's rDPUs under a browser such and Internet Explore or Netscape. 
O Preferably, the browser supports HTML, DHTML, Java-script, frames, VRML and lava- 
ge applets (NN4, IE4, and VRLM plugins) or the like. Of course, the browsers support all basic 
J7 feature such as site-surfing, login, search, etc. For LAN based implementations, the rDPUs 
f§ would use any custom software for query construction and submission and LAN 
03 communications. 
O Application Level 

u A preferred application level for systems implemented on a Gil includes components 

^ running on a server under a server OS such as UNIX based operating system and NT based 
2tT operating systems, which include GII services for server to server and server to user 
connections and communications such as IIS from MicroSoft or the SMTP, WAP or similar 
protocols. Application servers are connected with the data servers via ethernet or other wide 
band data communication protocol for LAN based systems or via wide band communication 
protocol for GII implementation. Moreover, the application server and the database server 
25 can be the same server or can be implemented on the same internet site. 
Data Level 

A preferred data level for systems implemented on a GII includes software 
components running on a database server under a server OS such as UNIX based operating 
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system and NT based operating systems, which include Gil services for server to server and 
server to user connections and communications. In addition, the database server includes a 
database, which can be any type of database including, without limitations, relational 
databases or multidimensional database such as OLAP database. In addition to the OS and 
5 to other standard software, the database server will include database service software 
including database communication protocol software such as SQL software (e.g., MS SQL 
Server) and MDDB service software such as MS OLAP Services. 
Detailed Description of the Drawings 

Referring now to Figure 1, a preferred embodiment of the system of the present 
10 invention, generally 100, is shown to include a rDPU 102 which also includes an operating 
system 104, a browser 106 and communication software 108. Of course, the rDPU 102 also 
q includes standard hardware components such as a processor, memory, mass storage devices, 
m and peripherals (not shown) . The rDPU 1 02 is in two-way communication with a application 
fl server 130 via an information infrastructure such as a LAN (local area network), a WAN 
fir (wide area network) or a global information infrastructure 120 using a communication 
0' protocol 122 such as HTML, XLM, GIF, Jave3D, TCP/IP, or the like. The application server 
O 130 includes an operating system 132, active server pages 134, pivot table services 136, 
M DMRs 138, a profiler 140, a database 142, a middleware 144 and communication software 
p 146. As with the rDPU 102, the application server 130 includes standard hardware 
20 components such as a processor, memory, mass storage devices, and peripherals (not shown). 
The application server 130 is in two-way communication with a database server 160 via the 
information infrastructure 120 using a protocol 124 such as MDX or OLE DB. The database 
server 160 includes an operating system 162, services 164 including database services such 
as OLAP services associated with OLAP mulitdimensional databases and SQL services, and 
25 communication software 166. 

Referring now to Figure 2, a preferred architecture, generally 200, for the system of 
this invention is shown schematically to include a presentation level 202, abusiness level 220 
and a data level 260. The presentation level 202 involves interaction with the user at the 
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rDPU 102 of Figure 1 using a communication protocol or combination of protocols 204 such 
as HTML, DHTML, pictures, JavaScript, Java3D ? etc. over the Gil 112 of Figure 1 and also 
involves text based message receiving and sending 206. The business level 220 includes an 
IIS 222, in two-communication with an ASP 224 and a SMTP 226. The business level 220 
5 also includes a query information content enhancing sub system (QIRCES) 228 including a 
query information content enhancing controller 230, a DMR library 232, a library of database 
interfaces 234, a profile controller 236, experts 238, a communication/visualization controller 
240 and an e-mail controller 242. The ASP 224 is in two-way communication with QIRCES 
228 and a component of the ASP 224 is in two-way communication with the 
1 0 communication/visualization controller 240 of QIRCES 228. The SMTP 226 is in two-way 
communication with the e-mail controller 242. The data level 260 includes DB services 262 
Q such as OLAP services for OLAP multidimensional databases and SQL services 264. The 
ol library 234 of QIRCES 228 is in two-way communication with the DB services 262 and the 
M SQL services 264. The present structure is applicable to any DB including MDDBs, 
lf| relational databases, hierarchical database or the like and the MWI would be a middleware 
^ product designed to interface with the particular database being accessed. 
i~ = ; Referring now to Figure 3, a block flowchart of a preferred query informational 

H content enhancing method of this invention, generally 300, is shown to start with the user 
□ constructing a query or search question step 302. The query can be constructed using any 
20 type of software that is capable of interacting with a database, including without limitation, 
database front ends, a search engine accessible for a network such as a internet or intranet, 
a spread sheet program such as Quattro Pro, Exel, etc. or any other software program that 
permits query construction and submission to a database. After the query is constructed 
(generally, typed into a text box in a screen), the query is forwarded over a network in a 
25 query send step 304 to an application server that captures the query in a query capture step 
306. The application server can be a server in an internet environment like a site on the 
world wide web or a digital processing unit in an intranet or LAN. The application server 
can be the same or different from the digital processing unit or server upon which the 
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database is resident. 

Once captured, the application server determines whether the query is a natural 
language query in a conditional test step 308. If it is a natural language query, then the 
method 300 transfers control along a YES branch 310 to a pre-process query step 312, where 
5 keywords and connectors are extracted from the natural language query. Once keywords and 
connectors are extracted from the natural language query, control is transferred to a forward 
query as is to a database step 314, where results and/or information directly related to the 
query is gathered. If the query is not a natural language query, but a Boolean query or other 
query that comprises keywords and connectors, then control is transferred along a NO branch 
10 316 to the forward query as is step 314. Next or simultaneous with the as is query forward 
step 314, related keywords and/or connectors are generated in a generate step 317. Next, the 
O query components (keywords and connectors) and related components (related keywords 
pj and/or connectors) are submitted to one or more DMRs in a submit step 318. 
L The DMRs operate on the query terms to generate a request or a plurality of requests 

ljS; for results and/or information from a database in and sends the request(s) to a middleware 
yj interface which facilitates data extraction from the database in a send requests to MWI step 
O 320. For relational database, the MWI can be one of a variety of MWI products available on 
«== the open market including, without limitations, CocoBase from Thought, Inc., DataDirect 
O SequeLink from Merant, DB2 Universal Database from IBM, dbAnywhere Server from 
20 Symatec, DbGen from 2Link Consulting, Inc., and other middleware products listed at 
www.j avaworld.com/j avaworld/tools/j w-tools-datamid.html or similar internet sites. For 
multi-dimensional databases, including OLAP databases, the middleware product is 
preferably the product disclosed in co-pending United States Patent Application Serial No. 
09/713,674, filed 15 November 2000, incorporated herein by reference. 
25 Once the MWI receives the requests in a MWI receive step 322, the MWI constructs 

appropriate database requests in a construct step 324 and sends the DB requests onto the 
database in a send DB requests step 326. Once the database receives the requests in a receive 
step 328, the database constructs results corresponding to the requests in a construct step 330, 
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and sends the results onto the MWI in a send results step 332. Once the MWI receives the 
DB results in a receive step 334, the MWI reviews the results and the MWI requests from the 
DMR and determines whether any additional requests are required to complete the MWI 
requests in a conditional step 336. If additional requests are required to produce a complete 
response to the DMR requests, then control is transferred along a YES branch 338 to the 
construct step 324 which repeats steps 326-336. 

Once the conditional step 336 determines no additional data are required to complete 
theMWI requests, then control is transferred along a NO branch 340 to a post-processing 
conditional step 342, where the MWI checks to determine whether the DB responses required 
post-processing or analysis prior to construction of DMR responses. If post-processing is 
required, then control is transferred along a YES branch 344 to a post-processing step 346 
and then to a construct DMR responses step 348; otherwise, control is transferred along a NO 
branch 350 directly to the construct step 348, where the DB results and any post-processing 
of the results are set forth in responses to the DMR requests and forwarded to the DMR in 
a send step 352. Next, the DMR receives the DMR responses in a receive step 354 and 
constructs User responses in a construct step 356. The user responses are then sent and 
displayed for the user in send step 358 and display step 360, respectively. 

Referring now to Figure 4, a block flowchart of a preferred user result interaction 
method of this invention, generally 400, is shown to include a display format conditional step 
402, where the routines check to determine whether the user prefers to see a condensed list 
of the results or prefers to see the result in page format from the get go. If the user prefers 
the list format, then control is transferred along a LIST branch 404 to display results list step 
406. Once the list is displayed, the user can select a given result by clicking on the result 
selector in a select step 408. Once selected, the routine displays a page format positioned at 
the selected result in a display page format step 410, If the user prefers the page format from 
the get go, the control is transferred from the conditional step 402 along a PAGE branch 412 
to the page format step 410, except that the page is positioned at the first result instead of at 
a selected result. The results page includes results and questions related to the query 
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generated by DMRs which can be toggled on and off to allow the user to follow or construct 
a query-by-question path through the related query data. When the user selects a given 
refinement or question, that question becomes a new query, which gives rise to new results 
and new questions. This process can be continued until the user either finds the result he/she 
5 desires or determines that the path is not leading to any results of interest. The system can 
also save the query-by-question path, which can be saved simply as a composite query 
including all of the keywords and constraints associated with the final result in the path. 

Once in the page format either at the start of the results or at some selected position 
within the page displayed results, the user can select a give query refinement by clicking on 
10 a given query refinement selector in a select refinement step 414. The method 400, then 
checks to see if the selected refinement requires additional processing in a conditional step 
Cj 416. If additional processing is required, then control is transferred along a YES branch 418 
m to a goto step 420, which transfers control to step 304 of Figure 3 . After the method set forth 
L in Figure 3 completes obtaining results corresponding to the additional processing, control 
lp is transferred to a display selected refinement step 422, which is also the step to which 
^ control is transferred along a NO branch 424, if additional processing is not needed. Once 
H the selected refinement is displayed, the method 400 check to see if the user wants to exit the 
H routine in an exit test step 426. If the user does not want to exit the method, then control is 
Q transferred along a NO branch 428 to the select step 414. If the user does want to exit, then 
20 control is transferred along a YES branch 430 to an exit step 432. The user review, displays 
and analyzes the refinements and the results derived therefrom using the SET A preferred 
SEI results screen display format is shown in Figure 7A&B and described herein. 

Figure 4 also illustrates the query-by-question method of this invention. As the user 
selects a particular refinement and 
2 5 Referring now to Figure 5 A, a first preferred structure, generally 500, of this invention 

is shown to include a user interface 502, which can be any interface capable of allowing a 
user to construct a search and submit the search to a database including, without limitation, 
a spread sheet such as Excel or Quattro Pro, a database front end or any other type of query 
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construction software routine in active communication with a database. In prior art database 
searching environments, the user interface 502 would communicate directly with a database; 
however, in the structure 500 of this invention, an intermediary routine, the QIRCES system, 
is interposed between the user interface 502 and the database. The QIRCES system 504. 
The system 504 includes a QIRCES controller 506, a query processor 508, which processes 
natural language queries to extract keyword and connector, a DMR library 510 and a MWI 
512. Finally, the structure 500 includes a database 514. The user interface 502 is in two-way 
communication with the QIRCES controller 506 via communication pathway 516. The 
QIRCES components are in two way communication as shown by the pathways 518, while 
the controller 506 and the MWI 512 are in two-way communication with the database 514 
along pathways 520. The controller 506 is in communication with the database 514 to 
transmit the query as is and to receive the as is query results, while the MWI 512 is in 
communication with the database 514 to transmit and receive information required by each 
DMR in the DMR library 510. The structure 500 can be implemented on a single digital 
processing unit, but is preferably implemented on a distributed processing environment such 
as an intranet (LAN or the like) or a global information infrastructure (the internet or world 
wide web). 

Referring now to Figure 5B, a block flowchart of a preferred user interaction method 
of this invention, generally 550, is shown to include a user interface 552, which can be any 
browser software program such as Explorer from MicroSoft, Netscape from Netscape, etc. 
and a search engine program 554 such as Excite, AltaVista, Ask Jeeves, HotBot, Google, 
Lycos Search, Netscape Search, etc. In prior art search engine searching environments, the 
search engine 554 would communicate directly with a database; however, in the structure 550 
of this invention, an intermediary system, the QIRCES system 556, is interposed between the 
search engine 554 and the database. The QIRCES system 556 includes a QIRCES controller 
558, a query processor 560, which processes natural language queries to extract keyword and 
connector, a DMR library 562 and a MWI 564. Finally, the structure 550 includes a database 
566. The user interface 552 is in two-way communication with the search engine 554 via 
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communication pathway 568, which is in two-way communication with the QIRCES 
controller 556 via communication pathway 570. The QIRCES components are in two way 
communication as shown by the pathways 572, while the controller 556 and the MWI 562 
are in two-way communication with the database 564 along pathways 574. The controller 
556 is in communication with the database 564 to transmit the query as is and to receive the 
as is query results, while the MWI 562 is in communication with the database 564 to transmit 
and receive information required by each DMR in the DMR library 562. The search engine 
554 can optionally be in direct two-way communication with the database 564 via 
communication pathway 576 and in optional direct two-way communication with the MWI 
562 via communication pathway 578. The structure 550 can be implemented on any 
distributed processing environment such as an intranet (LAN or the like) or a global 
information infrastructure (the internet or world wide web), but is preferably implemented 
on a global information infrastructure. 

Referring now to Figures 6A and B, an illustrative screen image, generally 600, of a 
preferred search engine interface to the QIRCES system and/or method of this inventions is 
shown to include a main window 602. In this figure and the associated figures relating 
associated with this search engine interface, the interface is shown to operate in the Microsoft 
Internet Explorer browser. It should be recognized that other browsers can be used as well. 

The main window 602 includes a browser banner 604, browser control buttons 606, 
a set of browser pull down menus 608, a set of active browser icons 610, and an address 
display area 612 with associated pull down menu button 614 to display previously visited 
sites. The main window 602 also includes a QIRCES SEI window 620, which illustrates a 
preferred implementation of the SEI of the present invention. The QIRCES SEI window 620 
includes a SEI banner 622 and a set of link buttons 624 to difference pages within the SEI. 
The link buttons 624 include a home link button 626, a register button 628, a my page button 
630, undefined buttons 632, and a contacts button 634. The QIRCES SEI window 620 also 
includes a select criterion selector 636 with associated pull down menu button 638, a measure 
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criterion selector 640 with associated pull down menu button 642, and a data mining criterion 
selector 644 with associated pull down menu button 646. The QIRCES SEI window 620 also 
includes a query construction and submission window 648. The query window 648 includes 
a English tab 650 for entering natural language queries in English (or any other language), 
a Boolean tab 652 for entering Boolean queries, a predefined tab 654, where the user can 
form one or more predefined queries or can select from a list of predefined queries, a DB 
structure tab 656, where the user can enter queries that are structured for direct interaction 
with a given database schema and a show field 658 and pull down button 660 for controlling 
the number of result shown per page in the result windows described herein. The English 
tab 652 includes a query entry field 662 with associated scroll controls 664 and search submit 
button 666. 

Looking at Figure 6B, the window 648 is shown with the Boolean tab 652 activated. 
In the Boolean query construction window format, the window 648 includes a first term entry 
field 668 with an associated Boolean keyword (word or phrase) entry field 670 and pull down 
menu button 672 and an associated entry type field 674 and pull down menu button 676. The 
window 648 also includes a second term entry field 678 with an associated Boolean keyword 
control field 680 and pull down menu button 682 and an associated entry type field 684 and 
pull down menu button 686. The window 648 also includes a Boolean keyword connector 
field 688 and associated pull down menu button 690 and a add button 692 to add additional 
keywords or terms to the query. 

Referring now to Figure 7A, the screen image 600 is shown displaying search results 
and includes the result window 700 having a banner 702 and including the search 704 to 
which the results apply. The window 700 also includes a first results section 706, which 
includes a set of descriptor fields 708 that correspond to the query keywords used in the 
search with associated toggle or check boxes 710 to toggle the keywords on or off. The first 
results section 706 also includes a set of proposed refinements 712 and associated toggle or 
check boxes 714 for turning the refinements on and off. The refinements 712 are simple 
refinements based on the query "as is" and not from DMR processing. The section 706 also 
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includes an open button 716, a database identifier field 718, a cube identifier field 720 and 
a measure identifier field 722. The result first section 706 includes information and 
refinements that related directly to the query and do not include related or enhanced results 
and/or information retrieval refinements that are generated via the operation of a DMR. The 
5 remaining results section are results that are derived from the operation of a DMR on the 
query. Thus, a second and third results sections 730 and 732 include results from different 
DMRs. 

The second and third results sections 730 and 732 include a set of primary result 
identifiers 734 with associated toggles or check boxes 736 some of which are turn on and 
10 some of which are turned off, and a set of refinements 738 with associated toggles or check 
boxes 740. The user can turn toggles or check boxes on or off and then open a given result 
it by hitting the open button 716 associated with the result section of interest. Once the user 
f f activates a result by hitting the open button 716 associated with a particular result section, 
ifl the SEI actives a detailed results screen. 

1 S3 Looking at Figure 7B, an illustrative screen image, generally 750, is shown containing 

3 detailed results and offering the user an opportunity to review the results in a cross-tab 
HI representation and a graphical representation. The screen 750 includes a detailed results 
SI window 752. The window 752 includes an active cross-tab 754 displaying cross tabulated 
yj data 756 relating to geographical categories 758, drink categories 760 and years 762. The 
20 window 752 also includes a graph 764 showing the displayed cross tabulated data in 
graphical form. The window 752 also includes a "more like this" active field 766, which 
sends a request to the QIRCES system to retrieve results like this in the QIRCES database. 
The window 752 also includes a "use as a template" active field 768 and a "save this query 
in my home page" active field 770, where the use a template field 768 instructs the QIRCES 
25 system to use the query refinement as a template for future queries and the save field 770 
instructs the QIRCES system to add the query to the user's home page for latter review. 

The window 752 also includes a first button 772 for going to the first result, a 
previous button 774 for going to the previous result, a next button 776 for going to the next 
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result, and a last button 778 for going to the last result. The window 752 also includes a 
rating or ranking protocol 780 shown here to include a Relevant button 782, a Neutral button 
784 and a Not Relevant button 786 with a rank result button 788. 

All references cited herein are incor porated by reference. While thi s invention has 
been described fully and completely, it should be understood that, within the scope of the 
appended claims, the invention may be practiced otherwise than as specifically described. 
Although the invention has been disclosed with reference to its preferred embodiments, from 
reading this description those of skill in the art may appreciate changes and modification that 
may be made which do not depart from the scope and spirit of the invention as described 
above and claimed hereafter. 
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